1 


Compressive  channel  estimation  and  tracking  for 
large  arrays  in  mm  wave  picocells 

Zhinus  Marzi,  Dinesh  Ramasamy  and  Upamanyu  Madhow,  Fellow,  IEEE 

{zh_marzi,  dineshr,  madhow}@ece . ucsb . edu 


Abstract — We  propose  and  investigate  a  compressive  architec¬ 
ture  for  estimation  and  tracking  of  sparse  spatial  channels  in 
millimeter  (mm)  wave  picocellular  networks.  The  base  stations 
are  equipped  with  antenna  arrays  with  a  large  number  of 
elements  (which  can  fit  within  compact  form  factors  because 
of  the  small  carrier  wavelength)  and  employ  radio  frequency 
(RF)  beamforming,  so  that  standard  least  squares  adaptation 
techniques  (which  require  access  to  individual  antenna  elements) 
are  not  applicable.  We  focus  on  the  downlink,  and  show  that 
“compressive  beacons,”  transmitted  using  pseudorandom  phase 
settings  at  the  base  station  array,  and  compressively  processed 
using  pseudorandom  phase  settings  at  the  mobile  array,  pro¬ 
vide  information  sufficient  for  accurate  estimation  of  the  two- 
dimensional  (2D)  spatial  frequencies  associated  with  the  direc¬ 
tions  of  departure  of  the  dominant  rays  from  the  base  station, 
and  the  associated  complex  gains.  This  compressive  approach 
is  compatible  with  coarse  phase-only  control,  and  is  based  on 
a  near-optimal  sequential  algorithm  for  frequency  estimation 
which  approaches  the  Cramer  Rao  Lower  Bound.  The  algorithm 
exploits  the  geometric  continuity  of  the  channel  across  successive 
beaconing  intervals  to  reduce  the  overhead  to  less  than  1% 
even  for  very  large  (32  x  32)  arrays.  Compressive  beaconing 
is  essentially  omnidirectional,  and  hence  does  not  enjoy  the  SNR 
and  spatial  reuse  benefits  of  beamforming  obtained  during  data 
transmission.  We  therefore  discuss  system  level  design  considera¬ 
tions  for  ensuring  that  the  beacon  SNR  is  sufficient  for  accurate 
channel  estimation,  and  that  inter-cell  beacon  interference  is 
controlled  by  an  appropriate  reuse  scheme. 

Index  Terms — Compressive,  mm  wave,  60  GHz,  picocells,  RF 
beamforming 


I.  Introduction 

The  explosive  growth  in  demand  for  wireless  mobile  data, 
driven  by  the  proliferation  of  ever  more  sophisticated  hand¬ 
helds  creating  and  consuming  rich  multimedia,  calls  for  orders 
of  magnitude  increases  in  the  capacity  of  cellular  data  net¬ 
works  [1].  Millimeter  wave  communication  from  picocellular 
base  stations  to  mobile  devices  is  a  particularly  promising 
approach  for  meeting  this  challenge  because  of  two  reasons. 
First,  there  are  huge  amounts  of  available  spectrum,  enabling 
channel  bandwidths  of  the  order  of  GHz,  1-2  orders  of  mag¬ 
nitude  higher  than  those  in  existing  systems  at  lower  carrier 
frequencies.  Indeed,  channel  bandwidths  could  potentially 
increase  even  further  with  advances  in  transceiver  technology 
such  as  bandwidth/power  consumption/linearity  tradeoffs  for 
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Fig.  1:  Picocellular  network  deployed  along  an  urban  canyon 


ultra  high-speed  analog  electronics,  and  speed/precision/power 
consumption  tradeoffs  for  analog-to-digital  converters.  Sec¬ 
ond,  the  small  carrier  wavelength  enables  the  realization  of 
highly  directive  steerable  arrays,  with  a  large  number  of 
antenna  elements,  in  compact  form  factors,  thus  significantly 
enhancing  spatial  reuse.  In  this  paper,  we  address  fundamental 
signal  processing  challenges  associated  with  channel  estima¬ 
tion  and  tracking  for  such  large  arrays,  placed  within  the 
context  of  system  design  for  a  mm  wave  picocellular  network. 

While  the  signal  processing  and  system  design  concepts 
presented  here  are  of  rather  general  applicability,  our  numer¬ 
ical  results  are  for  a  particular  setting  that  we  feel  has  great 
promise,  as  also  discussed  in  some  of  our  prior  publications 
[2],  [3].  We  propose  to  employ  the  60  GHz  unlicensed  band 
for  base  station  to  mobile  communication  in  outdoor  picocells: 
an  example  urban  canyon  deployment  is  depicted  in  Figure 
1  (discussed  further  in  Section  III).  At  the  base  station,  we 
consider  very  large  32  x  32  arrays  (such  1000-element  arrays 
are  still  only  palm- sized  at  a  carrier  wavelength  of  5  mm) 
targeting  the  long  term,  as  well  as  “moderately  sized”  8x8 
arrays  (which  can  fit  within  an  area  of  about  half  a  square 
inch)  which  are  currently  realizable.  Note  that  16-element 
arrays  were  reported  several  years  ago  [4],  and  are  already 
deployed  in  existing  60  GHz  products,  while  32-element  arrays 
have  been  prototyped  [5].  We  assume  that  mobile  devices  are 
equipped  with  smaller  4x4  antenna  arrays.  We  focus  on 
downlink  60  GHz  communication,  with  the  goal  of  enabling 
base  station  arrays  to  perform  transmit  beamforming  towards 
mobile  devices,  despite  the  challenges  posed  by  mobility  and 
blockage  (which  occurs  more  easily  at  smaller  wavelengths). 
We  do  not  count  on  reciprocity.  The  uplink  could  be  a  standard 
LTE  or  WiFi  link  at  lower  carrier  frequencies,  used  both  for 
uplink  data  (not  modeled  here)  and  feedback  for  enabling 
spatial  channel  estimation  at  the  transmitter. 

Multiple  antenna  systems  at  lower  carrier  frequencies  have 
a  relatively  small  number  of  elements,  each  with  its  own 
radio  frequency  (RF)  chain.  This  provides  control  of  the 
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Fig.  2:  Channel  sounding  scheme:  The  transmitter  repeats  the 
M  transmit  beacons  L  times  so  that  receivers  can  measure  the 
channel  Vij  corresponding  to  the  ith  setting  at  the  transmitter 
and  the  jth  at  the  receiver. 


individual  baseband  signals  associated  with  each  element,  en¬ 
abling  sophisticated  adaptation,  including  frequency-selective 
spatiotemporal  processing  (e.g.,  per  subcarrier  beamforming 
in  OFDM  systems).  This  approach  does  not  scale  when  we 
have  a  large  number  of  antenna  elements  packed  into  a  tiny 
form  factor.  Instead,  we  consider  RF  beamforming,  in  which 
a  common  baseband  signal  is  routed  to/from  the  antenna 
elements,  and  we  can  only  control  the  amplitude  and  phase  for 
each  element.  Indeed,  we  go  even  further,  assuming  that  the 
amplitude  for  each  element  is  fixed,  and  that  we  can  only  apply 
coarse  four-phase  control  for  each  element.  Standard  least 
squares  array  adaptation  and  channel  estimation  techniques, 
which  require  access  to  the  baseband  signals  associated  with 
each  element,  do  not  apply  in  this  setting.  Instead,  we  consider 
here  a  compressive  approach  which  exploits  the  sparsity  of  the 
mm  wave  channel,  so  that  relatively  few  measurements  are 
required  for  channel  estimation  despite  the  large  number  of 
array  elements. 

Contributions:  Our  contributions  are  summarized  as  follows: 
Architecture:  We  propose  a  novel  architecture  in  which  base 
stations  send  out  compressive  beacons,  with  a  different  set 
of  pseudorandom  phases  used  to  transmit  each  beacon.  Each 
mobile  measures  the  complex  gains  associated  with  each 
beacon  compressively,  using  pseudorandom  control  of  the 
phases  of  its  receive  array.  The  scheme,  described  in  more 
detail  later,  is  depicted  in  Figure  2:  the  base  station  sends  M 
beacons,  repeated  L  times,  which  permits  the  mobile  to  use 
L  different  settings  of  its  own  array  to  measure  the  associated 
complex  gains.  Each  mobile  feeds  back  appropriately  chosen 
functions  of  its  measurements  to  the  base  stations  on  the  uplink 
(which  can  be  a  standard  LTE  link).  Each  base  station  use  this 
information  to  estimate  and  track  the  dominant  paths  to  each 
mobile  that  it  receives  feedback  from. 

Algorithms:  For  the  regular  two-dimensional  (2D)  arrays  con¬ 
sidered  here,  directions  of  arrival/departure  map  to  2D  spatial 
frequencies.  The  base  station  estimates  the  spatial  frequencies 
to  each  mobile  using  a  simple  sequential  algorithm,  shown 


to  be  near-optimal  (in  terms  of  approaching  the  Cramer-Rao 
Bound)  in  related  publications.  The  algorithm  exploits  the 
geometric  continuity  of  the  channel  across  successive  beacon¬ 
ing  intervals  to  reduce  the  required  number  of  compressive 
measurements. 

System  Design:  While  we  do  not  provide  a  complete  system 
design  centered  around  our  compressive  strategy,  we  do  pro¬ 
vide  preliminary  results  addressing  some  of  the  most  important 
issues.  We  show  that  the  overhead  incurred  by  our  beaconing 
scheme  is  very  small  (less  than  1%).  Furthermore,  while  com¬ 
pressive  beacons  are  essentially  omnidirectional  (in  contrast 
to  the  highly  directive  beams  employed  for  communication), 
the  link  budget  suffices  for  accurate  channel  estimation,  and  a 
simple  beacon  reuse  strategy  suffices  to  control  inter-beacon 
interference  across  picocells. 

II.  Background  and  Related  Work 

There  is  a  significant  body  of  recent  literature  on  the  poten¬ 
tial  for  mm  wave  communication  for  next  generation  mobile 
cellular  networks  [6],  [7],  [8],  [9],  [10],  [3].  For  these  outdoor 
settings,  most  of  the  attention  has  focused  on  bands  other  than 
60  GHz;  for  example,  [10]  studies  blockage  probabilities  and 
achievable  throughput  based  on  measurements  in  the  28,  38, 
71-76  and  81-86  GHz  bands.  However,  as  pointed  out  in  our 
prior  work  [3],  60  GHz  has  immense  potential  at  the  relatively 
short  ranges  of  interest  in  urban  picocells,  with  the  oxygen 
absorption  characteristic  of  this  band  only  having  a  marginal 
impact  on  the  link  budget  (e.g.,  only  1.6  dB  at  100m).  While 
studies  such  as  [7],  [8],  [9],  [10]  focus  on  coarse-grained 
channel  statistics  (and  their  implications  on  system  design  and 
performance),  our  focus  here  is  on  signal  processing  for  fine¬ 
grained  channel  estimation. 

We  assume  that  the  channel  is  well  described  by  a  relatively 
small  number  of  discrete  rays  with  delays  and  angles  of 
arrival/departure  taking  values  in  a  continuum.  The  key  contri¬ 
bution  of  this  paper  is  to  provide  a  super-resolution  framework 
for  estimating  and  tracking  these  rays,  with  model-based 
estimation  allowing  us  to  go  beyond  (spatial)  bandwidth  based 
resolution  limits.  To  the  best  of  our  knowledge,  the  existing  lit¬ 
erature  on  mm  wave  measurements  does  not  attempt  to  super¬ 
resolve  channels  in  this  fashion,  hence  we  do  not  know,  for 
example,  whether  the  continuous  power-delay  profiles  reported 
in  [10]  are  consistent  with  a  parsimonious  channel  model  such 
as  ours.  However,  preliminary  experimental  results  [11],  which 
use  compressive  measurements  to  successfully  recover  power- 
angle  profiles  for  a  controlled  experiment  (a  small  number 
of  reflectors  in  an  anechoic  chamber),  indicate  that  a  simple 
ray-based  model  like  ours  may  well  suffice.  Validating  this 
assertion  would  require  application  of  the  more  sophisticated 
compressive  estimation  techniques  discussed  here,  as  opposed 
to  the  standard  basis  pursuit  algorithms  employed  in  [1 1].  Note 
that  existing  models  for  simulation-based  evaluations,  such  as 
the  channel  models  standardized  for  IEEE  802.  llad  indoor 
60  GHz  channels,  typically  assume  more  complex  models 
which  are  variants  of  the  Saleh- Valenzuela  model,  with  a 
number  of  clusters,  each  consisting  of  multiple  closely  spaced 
rays.  Such  cluster-based  models  could  be  motivated  by  the 
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roughness  of  reflecting  surfaces  such  as  walls,  but  they  have 
not  been  experimentally  demonstrated.  While  these  issues  fall 
beyond  the  scope  of  the  present  paper,  which  aims  to  make  a 
fundamental  contribution  to  signal  processing  for  sparse  spatial 
channels,  a  sustained  effort  in  measurement-based  validation 
of  our  model  and  approach  is  a  critically  important  topic  for 
future  work. 

An  alternative  approach  to  spatial  channel  estimation  with 
RF  beamforming  is  exhaustive  search,  but  its  overhead  does 
not  scale  for  the  large  arrays  of  interest  to  us.  Hierarchical 
codebook  search  is  more  efficient  [12],  [13],  [14],  [15],  but 
it  cannot  guarantee  overhead  reduction  in  a  multiuser  setting. 
Furthermore,  the  multi-resolution  beam  patterns  used  in  hier¬ 
archical  search  require  more  than  one  RF  chain  [15],  and  these 
methods  only  provide  binned  estimates  of  path  directions. 
In  contrast,  the  method  presented  here  provides  fine-grained 
estimates  while  using  only  a  single  RF  chain. 

A  hardware  enhancement  to  pure  RF  beamforming  as 
considered  here  (where  a  single  RF  chain  serves  all  antenna 
elements)  is  to  employ  hybrid  analog-digital  beamforming, 
with  a  number  of  RF  chains  smaller  than  the  number  of 
antenna  elements.  We  may  term  this  an  array  of  subarrays, 
with  RF-level  control  for  subarrays,  and  digital  processing 
of  subarray  outputs.  Such  an  approach  is  used  in  [15],  [16] 
for  spatial  channel  estimation.  Our  work  shows,  however, 
that  a  single  RF  beamformed  array  suffices  for  this  purpose. 
Of  course,  arrays  of  subarrays  are  certainly  required  for 
more  advanced  functionalities  such  as  multiuser  MIMO  [17], 
spatial  multiplexing  [18],  [19],  [20],  [21],  [22],  and  spatial 
diversity  [20],  [23],  [24].  Integrating  the  compressive  approach 
proposed  in  this  paper  within  an  array  of  subarrays  architecture 
is  an  interesting  area  for  future  work. 

The  present  paper  builds  on  our  prior  conference  papers  on 
compressive  array  adaptation  [25],  [2],  but  goes  well  beyond 
them  in  several  respects.  In  addition  to  a  more  detailed  devel¬ 
opment  of  the  analytical  framework  underlying  our  estimation 
algorithm,  we  now  explicitly  model  the  receive  array  at  the 
mobile,  which  requires  a  generalization  of  the  beaconing  and 
feedback  strategy.  We  also  address  system  level  design  for 
compressive  tracking  in  far  greater  detail,  discussing  link 
budget  and  overhead,  and  accounting  for  inter-cell  beacon 
interference.  Our  initial  work  on  compressive  array  adaptation 
[25]  subsequently  led  to  a  general  framework  for  compressive 
estimation  [26],  [27],  which  identify  the  isometries  required 
to  preserve  fundamental  bounds  such  as  the  Ziv-Zakai  (ZZB) 
and  Cramer-Rao  (CRB),  and  use  the  relationship  between 
these  bounds  to  provide  criteria  for  determining  the  minimum 
number  of  compressive  measurements  required  to  preserve  ge¬ 
ometry  and  to  permit  accurate  parameter  estimation  based  on 
a  signal  corrupted  by  an  AWGN.  We  now  adapt  these  general 
results  here  to  develop  guidelines  for  system-level  parameter 
choices.  We  have  also  pointed  out  recently  [28],  [29]  that  the 
ideas  behind  our  algorithm  apply  to  the  fundamental  problem 
of  sparse  modeling  using  a  mixture  of  sinusoids,  which  has  a 
range  of  applications,  including  super-resolution  in  the  context 
of  radar  and  (time  domain)  multipath  channel  estimation. 
The  setting  in  the  present  paper  is  more  complicated  than  in 
[28],  [29]  due  to  the  use  of  compressive  weights  at  both  the 


transmitter  and  the  receiver,  but  there  are  a  common  set  of 
key  ideas,  including  orthogonal  matching  pursuit  with  Newton 
refinement,  and  a  Constant  False  Alarm  Rate  (CFAR)  stopping 
criterion. 

III.  System  Model 

Given  the  high  demand  for  wireless  data  in  dense  urban 
environments,  we  focus  our  modeling  and  performance  eval¬ 
uation  on  the  urban  canyon  setting  depicted  in  Figure  1,  with 
streets  flanked  by  buildings  on  both  sides.  Picocellular  base 
stations  are  deployed  on  lampposts  in  a  zig-zag  pattern  on  both 
sides  of  the  street.  We  consider  mm  wave  transmission  on  the 
downlink  (for  beaconing  and  downlink  data)  and  LTE  or  WiFi 
at  lower  carrier  frequencies  on  the  uplink  (for  feedback  and 
uplink  data).  In  terms  of  channel  estimation  and  tracking,  this 
could  be  viewed  as  a  worst-case  assumption,  since  two-way 
transmission  on  the  same  mm  wave  band  could  potentially  be 
exploited  using  channel  reciprocity.  For  the  east-west  street 
shown,  each  base  station  has  two  “faces,”  facing  east  and 
west,  respectively.  Each  face  can  have  multiple  antenna  arrays, 
but  for  simplicity,  we  consider  a  single  antenna  array  for 
each  face,  used  for  both  compressive  beaconing  and  downlink 
data  communication.  Mobile  stations  are  modeled  as  either 
pedestrians  walking  along  sidewalks,  or  cars  moving  along 
the  street. 

In  our  simulations,  we  model  K  =  4  dominant  paths  from 
base  station  to  mobile  in  our  simulations:  the  line  of  sight 
(LoS),  and  the  single  bounce  reflections  from  the  ground 
or  the  side  walls.  Some  of  these  paths  may  be  blocked  by 
obstacles  (diffraction  around  obstacles  is  limited  for  small 
carrier  wavelengths).  We  ignore  multiple  bounces,  since  such 
paths  get  attenuated  significantly,  especially  because  each 
bounce  sees  a  smaller  angle  of  incidence  than  a  typical  single 
bounce.  However,  our  compressive  estimation  algorithm  does 
not  use  the  preceding  assumptions  on  number  of  dominant 
paths  as  prior  information,  and  automatically  discovers  and 
tracks  paths. 

For  a  regular  d-spaced  square  N1D  x  Nid  antenna  array 
and  a  point  transmitter  in  the  far-field,  the  channel  seen  by 
the  array  is  a  mixture  of  2D  sinusoids,  each  corresponding  to 
a  propagation  path,  and  is  given  by 

K 

hm,n  =  y2gkej^’km+w^n),  gk  e  C,  1  <  m,n,  <  N1D, 

k=  1 

where  gk  is  the  propagation  gain  of  the  k- th  path,  uJk  = 
(cox,k^ uz,k)  are  the  spatial  frequencies  corresponding  to  the 
k- th  path  (w.l.o.g.  we  have  assumed  that  the  square  array 
is  placed  in  the  x-z  plane,  with  its  sides  aligned  to  the 
coordinate  axes)  and  dm?n  refers  the  channel  seen  by  the 
ra,  n-th  antenna  element,.  The  spatial  frequencies  of  the  k- 
th  path  are  given  by  ujx^  =  2ir(d/\)  sin  Ok  cos  fk  and  ujz^  = 
27r(d/A)  sin  Ok  sin  </>&,  where  d  denotes  the  array  spacing,  A  the 
carrier  wavelength  and  (Ok,<l>k)  the  inclination  and  azimuthal 
angles  of  the  k- th  path  relative  to  x  —  z  plane.  We  vectorize 
the  2D  sinusoid  | ^ej(uxm+uzn)  ^  o  <  m,  n  <  Nm  —  l]  and 
denote  the  resulting  N^D  long  vector  by  x(7Vi£>,  a;),  where 
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oo  =  (cox,coz)  is  the  frequency  of  the  2D  sinusoid.  Vectorizing 
1  <  ra,  n  <  Nid]  in  an  identical  manner  gives  us 

K 

k= 1 

Now,  consider  a  base  station  transmitter  with  a  regularly 
spaced  2D  array  of  size  Ntj id  x  Nt^D,  and  a  mobile  receiver 
with  a  regular  2D  antenna  array  of  size  Nr^u  x  7Vr;i£>.  Let 
H  denote  the  corresponding  N^1D  x  N^1D  channel  matrix. 
Denoting  by  the  ith  row  of  this  matrix,  hf  is  the  response  of 
the  receive  antenna  array  to  the  ith  transmit  antenna.  Denoting 
x  u;)  by  xt  (oo)  and  x  (Nr^D,  oo)  by  xr  (u;),  under  the 

far-held  assumption,  it  can  be  shown  that 

K 

H  =  X^5feXt  (wfc)xr  (uk)  >  5fe  e  C.  (l) 

fc=l 

Since  we  know  the  array  geometries  (in  this  case,  a  regu¬ 
larly  spaced  2D  array),  an  estimate  of  the  N^1D  x  N%1D 
MIMO  channel  matrix  H  can  be  efficiently  arrived  at  by 
estimating  the  spatial  frequencies  and  the  associated  gains: 
{(#£,  ook) ,  &  =  1, . . . ,  K}.  Such  a  parametric  approach  is 
far  more  efficient  that  direct  estimation  of  individual  entries 
of  H,  and  enables  us  to  drastically  reduce  the  number  of 
measurements  required. 

IV.  Compressive  channel  estimation 

We  now  describe  our  compressive  approach  for  spatial 
channel  estimation,  which  consists  of  a  channel  sounding 
strategy  and  an  estimation  algorithm  which  allows  a  base 
station  to  estimate  the  propagation  gains  {\gk\}  and  the  spatial 
frequencies  {ook}  in  parallel  for  all  mobiles  in  the  picocell. 

A.  Channel  sounding 

The  basestation  sounds  the  channel  using  M  compressive 
beacons.  Each  beacon  is  a  known  signal  sent  using  a  different 
set  of  transmit  weights.  The  weights  are  chosen  uniformly  and 
independently  at  random  from  a  small  set  of  coarse  phase 
shifts  (for  e.g.  from  the  set  {=bl,=bj},  where  j  =  y/^1). 
We  may  therefore  view  each  beacon  as  being  transmitted 
from  a  different  “virtual  antenna”  with  a  quasi-omnidirectional 
pattern.  Each  of  the  M  transmit  beacons  are  repeated  L  times 
by  the  basestation  (see  Figure  2).  For  each  of  these  M  transmit 
beacons,  a  mobile  receiver  employs  L  “virtual  antennas”  to 
measure  the  channel  response,  using  receive  weights  chosen 
uniformly  at  random  from  {±1 ,  ± j } .  Let  y(m,n)  denote 
the  response  at  the  (ra,  n)th  receive  element  due  to  a  given 
transmit  beacon.  Letting  b(m,n,l)  G  {=bl,=bj}  denote  the 
weight  for  (ra,  n)th  receive  element  for  the  Ith  virtual  receive 
antenna,  the  response  seen  by  the  Ith  virtual  receive  antenna 
is  given  by 

'(0  =  E  &(ra,  n,  Z)  x  y(m,n),  1  <  l  <  L.  (2) 

l<m,n<  Nr,\D 

These  measurements  are  used  to  construct  the  M  x  L  Multiple 
Input  Multiple  Output  (MIMO)  “virtual  channel”  matrix  V 
between  the  virtual  transmit  and  receive  antennas.  Note  that  we 


do  not  require  that  the  base  station  know  the  receive  weights 
used  by  the  mobile,  or  that  the  mobile  know  the  transmit 
weights  used  by  the  base  station. 

Denoting  the  vectorized  version  of  weights  of  the  i-th  virtual 
transmit  antenna  by  a i  (a  vector  of  shape  N^1D  x  1)  and  that 
of  the  j-th  virtual  receive  antenna  by  b j  (N^1D  x  1),  the 
i,j- th  element  of  V  (the  channel  between  the  th  virtual 
transmit-receive  pair)  is  given  by  Vij  =  af  Hb j.  Letting  A  = 
[ai  ...  a m]T  and  B  =  [bi  ...  bz,]T,  it  is  easy  to  see  that 

V  =  ABBt .  Using  (1),  we  have  that 

K 

V  =  ^9k  (Axt  (u>l))  (Bxr  (u>£))T .  (3) 

k= 1 

The  channel  measurements  are  perturbed  by  i.i.d  measurement 
noise,  and  are  given  by 

Ui,j  =  H-  ^i,ji  ~  CjV(0,  cr  ), 

where  Pe  is  the  per-element  transmit  power.  Letting  Y  and  Z 
denote  M  x  L  matrices  with  their  i,  j-th  entries  given  by  yij 
and  Zi  j  respectively,  the  “measured  virtual  channel”  is  given 
by 

_ K 

Y  =  x/Av+z  =  (ixf  (c4))  (Bxr  (c4)f+Z. 


B.  Feedback  strategies 

Our  goal  is  to  track  the  mm-wave  spatial  channel  as 
seen  from  the  basestation,  described  by  the  parameters  P  = 
{(\gk  \  7  ,  k  =  1, . . . ,  K}.  To  this  end,  every  mobile  in  the 

picocell  needs  to  feed  back  a  portion  of  the  measured  virtual 
channel  Y  to  the  basestation.  From  (4),  we  see  that  the 
information  regarding  the  spatial  channel  as  seen  from  the 
basestation,  given  by  P,  is  available  in  the  column  space  of 
Y.  Building  on  this  observation,  we  propose  two  feedback 
strategies: 

(i)  The  receiver  feeds  back  the  entire  matrix  Y. 

(ii)  A  more  elaborate  strategy  involves  feeding  back  Q  <  L 
strongest  left  singular  vectors  of  Y  scaled  by  their  corre¬ 
sponding  singular  values,  i.e.,  if  Y  =  Y^t= i  (JiuiviI  with 
or  >  cr 2  >  •  •  •  >  ctl  >  0,  the  receivers  feed  back  D  = 
[or ui  *  *  *  ctqUq].  This  strategy  identifies  the  Q -dimensional 
subspace  of  the  column  space  of  Y  with  maximum  energy. 

V.  Estimation  Algorithm 

We  now  present  an  algorithm  to  estimate  the  parame¬ 
ters  {(\gk  \ ,  ook) ,  k  =  1, . . . ,  K}  characterizing  the  mm-wave 
channel  as  seen  from  the  basestation.  The  same  algorithm 
applies  for  both  forms  of  feedback  discussed  in  Section  IV-B: 
the  entire  measured  virtual  MIMO  matrix  Y  or  the  dominant 
weighted  left  singular  vectors  D. 

The  Ith  column  of  Y  is  given  by 

K 

y  1  =  ^2  hk,iAyit  (Wfe)  +zh  1  =  1,  ...,L  (5) 

k= 1 
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where  z i  ~  CM  (0,  cf21m )  denotes  the  l- th  column  of  Z 
and  hk,i  =  xr  (u;£).  We  assume  the  weight  se¬ 

quence  {b^}  used  to  construct  the  receive  virtual  antennas  at 
the  receive  antenna  array  is  not  available  at  the  transmitter, 
and  hence  cannot  jointly  estimate  uk  and  uk.  However, 
since  {b^,  l  =  1 were  picked  in  an  i.i.d  manner, 
we  have  that  {ft&,z,  l  =  1, . . .  ,L}  are  i.i.d  realizations  of  a 
random  variable  with  E| hkjf  =  PeE\gkbrf  xr  (u>k)\  = 

PeN2 1D  \gk\\  This  allows  us  to  estimate  \gk\2  as  follows: 

Pe\gk\2  =  (l/(LN*1D))ir 

1  =  1 

From  here  on,  in  Section  V,  we  use  the  notation  uk  to  refer  to 
Mk  and  x(cj/e)  to  refer  to  Axt(uk).  Thus,  the  measurements 
can  be  written  as 

K 

y i  =  ^2 hk’lX +Zh  l  =  (6) 

fc= i 

We  now  provide  an  algorithm  to  estimate  ,  {ft/^}}. 

A.  Single  path 

We  first  present  an  algorithm  for  estimating  a  single  path 
K  =  1,  which  forms  the  basis  for  our  sequential  estimation 
algorithm  for  K  >  1.  Omitting  the  path  index  k  in  (6),  we 
have 

y i  =  hi*(u>)  +ZJ,  l  =  1,...,L. 

Given  that  {z^}  are  independent  realizations  of  CM  (0,  ct2Im)? 
the  maximum  likelihood  (ML)  estimates  of  a;,  {hi}  are  given 
by: 

L 

uAhi]  =  argmin  V"  ||yz  -  ftjx(u>)||2  .  (7) 

1  J  fMi 


(This  residue  is  used  for  sequential  detection  for  K  >  1,  as 
discussed  shortly.) 

Refinement  phase:  Our  estimate  from  the  detection  phase  is 
restricted  to  the  discrete  set  <f>  and  consequently  we  do  not 
expect  2j  to  be  equal  to  the  ML  estimate  given  by  (9)  (where 
the  maximization  is  over  [— 2nd/ A,  2nd/\\2  with  d  being  the 
spacing  between  transmitter  antennas).  However,  if  we  make 
the  grid  fine  enough,  the  best  estimate  of  co  in  <f>  is  expected 
to  be  close  enough  to  the  optimal  solution  to  allow  refinement 
via  local  optimization.  In  order  to  do  this,  we  first  fix  the 
gain  estimates  {hi}  and  refine  only  the  estimate  of  the  spatial 
frequency  co  by  seeking  the  minimizer  of  the  ML  cost  function 

L  2 

C(u>)  =  ^2  |yj  -  hix(oj) 

1  =  1 

in  the  neighborhood  of  the  current  estimate  Co  using  the 
Newton  method.  This  involves  evaluating  the  gradient  vector 
Q  (co)  and  the  Hessian  matrix  R(io)  of  C(u>)  at  the  current 
estimate  Co.  The  corresponding  expressions  are  given  by: 


Gi(w)  — 

nn  M  = 


duj 

d2c(u>) 

dujjdujn 


dC{-“)  =-2]T5ft 

1  =  1 

=  -2^ 5ft |  (viM 


l  dx(u)\\ 


dhcjuj) 

dujidujj 


hi 


2  /  dx(cj)  dx(cv) 
\  duJi  ’  dujj 


where  u>  =  {r^}-s  are  the  residual  measurements 

given  by  (10).  The  Newton  update  for  2)  is 


<-  d>  —  T-L  1  (o>)  Q  (2j)  .  (11) 

We  follow  this  up  by  updating  our  estimates  j  hi  j  by  plugging 
the  new  value  of  2)  in  (8),  i.e., 


For  any  u>,  the  optimal  hi- s  are  given  by  least-squares  esti¬ 
mates: 

h*i{u)  =  (x(a>),yi)/||x(u>)||2  (8) 

where  (x,  y)  denotes  xHy.  Plugging  into  (7),  the  ML  estimate 
of  (jj  is  given  by: 

1  L 

<h  =  arg max  2  ^  |(x (u>) , y;)|2  (9) 

"  iixHil  ixi 

and  the  ML  estimate  of  hi  is  given  by  ft,*  (d>).  We  employ  a 
two-step  algorithm  to  arrive  the  ML  estimates:  a  “detection” 
phase  followed  by  a  “refinement”  phase. 

Detection  phase:  Using  M  two-dimensional  FFT  computa¬ 
tions,  we  precompute  x(u>)  for  frequencies  of  the  form  uj  G 
$  =  {(2m/T,2irj/T),  0<i,j<T-l},  T  =  RN1D, t, 
where  R  is  the  oversampling  factor.  We  pick  the  frequency 
u  from  which  maximizes  (9).  The  corresponding  gains  are 
given  by  hi  =  ft*  (d>).  We  remove  the  contribution  of  the 
newly  detected  sinusoid  from  the  measured  channel  response 
and  this  residual  measurement  is  given  by 

(10) 


hi  <r-  hi  (u)  =  (x  (w),  y i)  j ||x  (u>) ||2  (12) 

and  modifying  the  residues  ({r /})  accordingly  using  (10).  The 
algorithm  alternates  between  the  updates  in  (11)  and  (12)  for 
a  few  iterations. 

B.  Multiple  paths 

We  now  build  on  the  preceding  single  path  algorithm 
for  the  general  setting  of  K  >  1.  Suppose  that  our  cur¬ 
rent  estimate  of  the  sinusoids/paths  is  given  by  Vk  = 
|  (^2>k,  {ftfc,z})  ,  ft  =  1, . . . ,  kJ.  The  residual  measurements 
corresponding  to  a  set  of  estimated  parameters  V  is  given  by: 

M'P)  =  y i-  T,uk,{hktl}ev  hk,i x  (wfc)  • 

Detect  a  new  path:  Assuming  that  the  measurements  y i  are 
given  by  the  current  residue  vi(Vq)  (corresponding  to  the 
q  detected  paths),  we  use  the  single  path  algorithm  in  Sec¬ 
tion  V-A  to  detect  and  refine  a  new  sinusoid  (2)q+ 1,  {ftg+i,z}). 
Let  Vq+i  denote  the  new  set  of  estimated  parameters  Vq  U 
{(^qr+lj  {hq+l,l})}- 

Refine  existing  paths:  Once  we  add  this  new  path,  we  refine 
the  parameters  of  all  q  +  1  sinusoids  in  Vq+i  one  by  one. 


ri=yi-  hix  (u>) . 
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Consider  the  parameters  (d)*.,  {hk,i})  of  the  k- th  sinusoid. 
We  use  the  refinement  algorithm  in  Section  V-A  to  refine 
(d>fc,  {hk,i})  by  assuming  that  the  measurements  yi  are  given 
by  the  residual  measurements  after  excluding  the  sinusoid  of 
interest,  i.e,  vi{Tq+ 1  \  {(d>&,  {hkj})}).  Sinusoids  are  refined 
in  a  round  robin  manner,  and  the  process  is  repeated  for  a  few 
rounds:  1  2  -A  •  •  •  -A  (q  +  1)  -A  1  -A  •  •  •  -A  (g  +  1). 

Stopping  criterion:  The  algorithm  terminates  when  the 
residues  {v/(Tg)}  after  refinement  satisfy: 


Deleting  weak  paths:  Paths  estimated  in  prior  rounds  may  not 
be  viable  at  the  current  time  instant  (e.g,  because  of  blockage). 
Therefore,  we  need  means  to  remove  such  stale  paths.  We  use 
the  stopping  criterion  (13)  as  a  means  to  delete  weak  paths. 
If  deleting  the  path  under  question  (say  k)  and  optimizing 
other  parameters  7\0pt  Refine  (v  \  (d>&,  {/ifc,z}))  using 
our  refinement  algorithm  results  in  residual  measurements 
{viiVk, opt)}  that  can  be  explained  by  noise  (i.e.,  satisfies  (13)), 
we  delete  the  path  permanently.  Otherwise  we  keep  the  path. 


max 

u>£DFT 


i 

llxMII2 


EKX  M,v*(A)>  I2 


1=1 


<  T, 


(13) 


with  the  threshold  r  given  by 

aV1  (i,r(L)(l-Pfa)1/JV?-.‘)  where  7"1(S',y) 
is  the  inverse  of  incomplete  gamma  function 
7  (S,x)  with  respect  to  integral  limit  x  and 
DFT  =  {(2m/N1D^j/N1D:t)  :  0  <  i,  j  <  N1D,t  -  1}. 
This  choice  of  r  corresponds  to  a  Constant  False  Alarm 
Rate  (CFAR)  test  (the  false  alarm  rate  being  Pfa)  for  whether 
the  residual  measurements  {vi(Vq)}  can  be  explained  by 
another  path.  In  arriving  at  this  expression  for  r  we  make  two 
assumptions:  (i)  channel  sounding  is  not  compressive:  i.e, 
A  =  1^2^  ^  and  (ii)  new  paths  correspond  to  specific  beam 
orientations  given  by  u?  G  DFT.  However,  our  simulation 
results  show  that,  this  is  a  good  approximation  for  our  case 
with  compressive  measurements  and  continuous  values  of 
beam  directions.  We  refer  the  reader  to  Appendix  A  for 
details  on  the  stopping  criterion. 


C.  Tracking 

We  sound  the  channel  often  enough  so  that  between  any  two 
successive  channel  estimation  cycles,  the  geometry  of  the  mm- 
wave  channel,  given  by  the  spatial  frequencies  {u>k}  of  the 
paths,  does  not  change  “significantly,”  even  if  the  path  gains 
{gk}  do.  This  ensures  that  angle  of  departure  estimates  from 
the  prior  sounding  round  do  not  become  stale  over  the  course 
of  the  communication  phase  during  which  they  are  needed 
for  beamforming  purposes.  For  example,  if  we  do  not  wish  to 
tolerate  a  beamforming  loss  of  3dB  or  more,  then  our  estimate 
from  the  previous  round  should  be  close  enough  to  the 
current  so  that 

l(x  (u>),x(u)))\2  j ||x (u>)||2  >  0.5 

over  the  entire  communication  phase.  This  condition  is  met  if 
||o?  —  a>||oo<0.5x  (2ir/Nt,iD)-  Therefore,  the  estimates  of 
spatial  frequencies  from  the  previous  sounding  round  are  good 
approximations  of  their  current  true  value  (within  a  DFT  spac¬ 
ing  of  N2t7[D  )•  We  exploit  this  by  using  k  =  1, . .  „ ,  K} 
from  the  prior  round  to  initialize  our  algorithm  (as  opposed  to 
using  the  empty  set  {}).  We  do  this  by  constructing  the  matrix 
X  =  [x  (u?i)  ...  x  (&k)]  and  setting  hk,i  to  be  the  (fc,  Z)-th 
entries  of  (! XHX)~1XHY ,  where  Y  =  [yi  ...  yL],  We 
refine  all  parameters  in  Vk  =  {(<£&,  {/&&,/})>  k  =  1, . . . ,  K} 
using  the  refinement  algorithm  in  Section  V-B  before  proceed¬ 
ing  to  seek  for  new  paths  using  the  algorithm  in  Section  V-B. 


VI.  Protocol  Parameter  Choices 

In  this  section,  we  give  a  principled  approach  to  choosing 
parameters  of  the  compressive  channel  estimation  protocol, 
namely  the  number  of  unique  transmit  beacons  M,  the  number 
of  receive  measurement  weights  L  and  the  minimum  effective 
Signal  to  Noise  Ratio  (SNR)  needed  for  channel  estimation, 
which  we  use  to  choose  the  sounding  bandwidth  Ws .  We  then 
turn  to  the  question  of  how  frequently  the  channel  has  to  be 
sounded.  In  Section  VIII,  we  take  two  scenarios  and  apply  this 
recipe  to  arrive  at  system  level  parameters  for  the  protocol. 


A.  Number  of  compressive  transmit  beacons 

Classical  compressive  sensing  aims  to  reconstruct  signals 
which  are  sparse  in  a  discrete  basis,  based  on  a  small  number 
of  projections.  In  order  for  reconstruction  to  be  successful, 
these  projections  must  preserve  the  underlying  geometry  [30]. 
In  [27],  these  ideas  are  extended  to  compressive  estimation, 
in  which  a  small  number  of  projections  are  used  to  estimate 
continuous- valued  parameters. 

We  now  translate  these  concepts  to  our  present  context. 
Our  goal  is  to  estimate  the  spatial  frequencies  {c<4}  from 
measurements  of  the  form 


K 

yi  =  E  hk,lAxt  (Wfe)  +  z /,  l  =  1, «  .  .  ,  L. 

k- d 


The  algorithm  in  Section  V  aims  to  estimate  parameters 
{hk,i,  Wk}  by  minimizing  the  ML  cost  function: 


E1U  y* _  Ek= i  hk,iAxt  (wfc) 


=  Ef=i  A  x  Ef=i  (hk,ix-t  04)  -  hk,i*t  (&£))  + 


Z I 


refer  to  our  estimates  of  {hk,i,  w|}-  If  the 


where 

compressive  measurement  matrix  A  ensures  that 


A  x 


Ef=i  (V*xt  04)  -  4mx*  04)) 


(14) 


M 


Ef=i  (hk,i*t  04)  -  hk,(x-t  04))  )  V hk,i ,  hk,u 


for  relevant  ({c^/e},  {u^}) -pairs,  the  cost  structure  of  the 
estimation  problem  is  roughly  preserved.  Therefore,  estimation 
using  compressive  measurements  is  similar  to  estimation  with 
all  Vid  measurements  (except  for  a  reduction  in  effective 
SNR,  given  by  M/N^1D)  [27]. 

We  now  state  an  isometry  property  relevant  for  our  purpose. 
For  some  fixed  e,  the  matrix  <f>  G  Cmxp  is  said  to  enjoy  the 
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s-isometry  property  for  the  basis  B  (of  size  p  x  n)  if  there 
exists  a  constant  C  >  0  such  that 

C(l-e)<  ||$Bu||2/||i?u||2  <  C(1  +  e), 

for  all  s-sparse  u  in  Cn.  It  can  be  shown  using  the  celebrated 
Johnson-Lindenstrauss  Lemma  (JL  Lemma)  [31]  (with  argu¬ 
ments  similar  to  those  in  [30])  that  if  m  —  0(se~2  logn), 
a  randomly  picked  <f>  (with  elements  drawn  from  proper 
distributions,  e.g.,  Uniform{±l,  ±j}  [27],  [31])  satisfies  this 
isometry  property  with  high  probability. 

In  order  to  provide  a  concrete  definition  of  geometry 
preservation  in  our  setting,  we  consider  a  discretized  version 
of  the  problem,  when  spatial  frequencies  are  restricted  to 
an  oversampled  DFT  grid  G  of  size  R  =  0(N21D).  The 
condition  (14)  reduces  to  a  2K -isometry  property  of  the 
measurement  matrix  A  relative  to  the  basis  X,  where  X  is  the 
N2 1D  x  R  matrix  with  its  columns  given  by  {xt  (a;)  :  u  G  G}. 
We  can  now  apply  the  preceding  isometry  property  to  conclude 
that,  when  the  number  of  unique  transmitter  beacons  scales 
as  M  =  O  [Ke~2  logi?)  =  O  [Ke~2  log  NtyiD),  then  the 
2K -pairwise  isometry  criterion  w.r.t  the  basis  X  is  met  by 
the  randomly  picked  sounding  matrix  A  (it  can  be  shown 
that  C  =  M  for  our  choice  of  scale),  thereby  ensuring  that 
the  geometry  of  the  spatial  frequency  estimation  problem  is 
preserved. 

While  the  preceding  calculations  give  order  estimates  for 
the  number  of  measurements  M  required,  in  order  to  pro¬ 
vide  numerical  values  for  our  protocol  design,  we  employ 
simulations.  We  consider  the  example  of  the  32  x  32  trans¬ 
mitter  array  and  plot  the  maximum  and  minimum  values  of 
(1/M)  ||  AXull2^/ ||Xu||2  from  5  x  106  random  realizations 
of  a  2 K  =  8-sparse  u  (which  represent  different  realizations 
of  K  =  4  paths  and  their  corresponding  potential  estimates) 
in  Figure  3  using  the  64-times  oversampled  DFT  grid  as  the 
choice  of  basis  X.  We  see  that  this  ratio  is  within  [—5,3]  dB 
when  M  >  30.  This  illustrates  that  for  estimating  K  =  4  paths 
using  a  32  x  32  array,  measuring  the  response  corresponding  to 
M  =  30  random  beacons  approximates  the  effect  of  measuring 
all  32  x  32  =  1024  antenna  elements  individually. 

B.  Number  of  compressive  receive  measurements 

While  we  do  not  track  {u;£,  k  =  1  the  spatial 

frequencies  at  the  receiver,  we  need  to  ensure  that  the  set  of 
measurements  made  at  the  receiver  have  sufficient  informa¬ 
tion  to  estimate  transmitter  spatial  frequencies.  Suppose  that 
||Bxr(wj[)||  «  0,  it  follows  from  hktl  =  gk\[Ve bfxr(a>£) 
that  all  L  measurements  {y /,  1  <  /  <  L}  will  have  very  small 
contributions  from  the  k- th  path,  i.e.,  \hk,i\  ~  0, 1  <  l  <  L. 
To  see  this  observe  that: 

=  pe  \9k\2y2MXr  =  Pe  \9k\2  \\Bxr  (<*>£)  || 

1  =  1  1  =  1 

(15) 

Again,  restricting  the  receive  spatial  frequencies  to  an  over¬ 
sampled  DFT  grid  G  of  size  R  =  0(N21D),  it  can  be  shown 
that  for  L  =  O  (log 72)  =  O  (log  JVr,’i£>),  ||Bxr(cj)||2  « 


Fig.  3:  Maximum  and  minimum  values  of 
||AXu||2/(M||Xu||2)  for  different  values  of  M,  the 
number  of  transmitter  beacons,  across  5  x  106  random 
realizations  of  8-sparse  u.  The  basis  X  corresponds  to  the 
responses  for  a  32  x  32  array  evaluated  uniformly  over  a 
R  =  64  x  322 -sized  grid 


Fig.  4:  Maximum  SNR  degradation 

min^  ||ILxr(u;)||2^(Z/  ||xr(u>)||2)  for  the  most  favorable 
realization  (from  104  runs)  of  an  L  x  N21D  matrix  B  with 

Nr, ID  =  4 

L  ||xr  (cj)||2  =  LN21D  w.h.p.  This  ensures  that 

L 

J2\hk,i\2  «  PeLN?tlD\gk\2  W.h.p, 

1  =  1 

thereby  capturing  power  along  the  k- th  path.  We  perform 
computations  for  the  maximum  power  lost  across  spatial 
frequencies  when  using  a  4  x  4  array  and  plot  the  results  in 
Figure  4.  This  shows  that  around  5  carefully  chosen  projec¬ 
tions  (we  pick  the  best  measurement  matrix  from  104  random 
instances)  suffice  to  ensure  that  SNR  degradation  (relative  to 
the  nominal  value  of  L)  is  no  greater  than  3dB  for  a  4  x  4 
receive  array. 

C.  SNR  for  successful  estimation 

The  preceding  criteria  delineate  the  regime  in  which  the 
geometry  of  the  estimation  problem  is  preserved  approxi¬ 
mately.  We  now  turn  to  another  factor  which  affects  estimation 
performance,  namely  the  SNR.  Consider  measurements  of  the 
form 

Vm,n  =  eP(-Uim+U2n+<l,'>  +Zm,n,  0  <  TO,  n  <  Nt,  1D~1,  (16) 


Fig.  5:  ZZB  threshold  SNR  SNRth  for  estimating  the  frequency 
of  a  Ntjio  x  ^t,iD  sinusoid  as  a  function  of  Nt^D 


where  zm?n  are  i.i.d.  CJ\f(0,  cr2)  and  spatial  frequencies  cci,  CC2 
and  phase  <j)  are  parameters  to  be  estimated.  The  Cramer 
Rao  Bound[32]  (CRB)  for  estimating  u)\  from  measurements 
(16)  is  given  by  C(a2)  =  6/(SNR  (N21D  —  l)) ,  where 

SNR  =  ||xt(07)||  2/a2  =  Nl1D/  a2  (same  expression  holds 
for  cj2)-  Assuming  an  uniform  prior  over  [0,27 r)3  for  the 
parameters  (cci,  co>2, </>),  the  Ziv  Zakai  Bound  (ZZB)  with 
periodic  distortion [3 3]  for  estimating  00 1  evaluates  to: 


Z(SNR)  =  J  Q  ^SNR  ^1  - 


sin  (Nttir>h/2) 


Nt,iD  sin  (h/2) 


hdh. 


An  indicator  of  the  SNR  needed  for  successful  estimation 
is  the  convergence  of  the  ZZB  to  the  CRB  [27].  We  use 
the  SNR  beyond  which  the  ZZB  is  within  O.ldB  of  the 
CRB  as  a  measure  of  this  convergence.  We  plot  this  ZZB 
threshold  SNR  for  different  values  of  NtjiD  in  Figure  5.  e.g, 
SNRth  =  16.04dB  for  an  8  x  8  array  and  SNRth  =  16.13dB 
for  a  32  x  32  array. 

The  total  energy  Etot  corresponding  to  the  k- th  path  col¬ 
lected  across  the  ML  measurements  {yj,  1  <  l  <  L}  is  given 
by: 


L 

Etot  =  ||Axt  (c4)||2  X  \hkti\2  . 

1  =  1 

Using  (15)  in  the  above,  we  have  that 

E,ot  =  || Xxt  (wjfe)  ||2  ||-Bxr  {urk)\\2  Pe  \gk\2 
~  MLN?tlDN*1DPe  \gk\ 2 
=  MLPN?ad  \gk\2 , 

where  P  =  N21DPe  is  the  total  transmit  power  supplied  to  the 
Nt,iD  x  ? id  antenna  array.  The  above  approximation  holds 
when  M  and  L  satisfy  the  preceding  geometry  preservation 
criteria  in  Sections  VI-A  and  VI-B  respectively.  The  effective 
SNR  of  the  i-th  path  is  given  by  SNReff  =  Etot/cr2.  It  is 
important  to  note  that  the  per-measurement  noise  variance  cr2 
is  given  by  cr2  =  N21Dcr2,  where  cr2  is  the  noise  variance 
per  antenna  element.  Assuming  no  interference  (which  we 
account  for  in  Section  VII-D),  cr2  =  NoWs  with  Ws  denoting 
the  sounding  bandwidth  and  No  the  thermal  noise  floor.  The 
reason  for  the  scale  factor  N2 1D  in  the  expression  for  cr2  is  the 
following:  Our  measurement  process  consists  of  multiplying 


the  received  signal  at  each  antenna  (of  which  there  are  N21D) 
by  phasors  6(m,n,/)  G  {±l,=bj}  and  adding  the  resultant 
signal  (as  per  (2)).  Since  thermal  noise  seen  by  the  N21D 
isolated  receive  antennas  are  independent  random  variables, 
we  have  that  a2  =  N21D  x  (N0WS).  Therefore,  the  effective 
SNR  of  the  i-th  sinusoid  is  given  by: 

SNReff  =  MLP  \gk\2 /n0Ws  . 

This  must  exceed  the  ZZB  threshold  SNRth  for  successful 
estimation.  Noting  that  ML /Ws  is  the  time  taken  for  channel 
sounding,  the  ZZB  threshold  SNRth  gives  us  the  means  to 
evaluate  the  minimum  overhead  in  time  to  estimate  the  channel 
for  a  given  path  gain  \gk\2'- 

Time  taken  =  ML/WS  >  SNRth  No^P  \gu\2  (17) 

The  size  of  the  picocell  gives  us  a  lower  bound  on  \g^  |2  and  we 
later  use  this  to  guide  us  in  choosing  the  sounding  bandwidth 
Ws  using  (17). 

D.  Sounding  rate 

We  round  off  the  discussion  on  choice  of  protocol  pa¬ 
rameters  by  giving  a  rule  of  thumb  for  the  rate/frequency 
fs  at  which  the  spatial  channel  {cok}  needs  to  be  reesti¬ 
mated.  We  use  the  estimated  spatial  frequency  Co  for  beam¬ 
forming  purposes  in  the  time  period  between  two  channel 
sounding  rounds  (communication  phase  sandwiched  between 
consecutive  sounding  phases;  see  Figure  7).  Following  the 
discussion  in  Section  V-C,  we  have  that  if  \\oo(t)  —  Co  Hoc  < 
7r/iVt,iz>  throughout  the  communication  phase,  where  00 (t) 
denotes  the  true  spatial  frequency  and  Co  the  estimate  from 
the  prior  sounding  round,  then  the  loss  in  SNR,  given  by 
1 1 (uo (f ) )  1 1 2 y/ (xt  (cD),xt  (lo (t))) ,  is  smaller  than  3dB.  If  we 
assume  that  the  closest  user  to  the  basestation  array  is  at  a 
distance  R  meters  and  that  the  maximum  speed  of  a  user 
in  the  picocell  is  given  by  ?;max  meters  per  second,  then 
the  maximum  change  (in  terms  of  the  ^oo-norm)  in  spatial 
frequency  Acc  between  consecutive  sounding  phases,  spaced 
1  jfs  apart,  is  given  by  27r<i?;max//£PA.  The  worst-case 
geometry  which  achieves  this  bound  is  when  the  user  is  at 
a  distance  R  along  the  bore-sight  of  the  array  and  heading  in 
a  direction  aligned  with  the  one  of  the  array  axes.  For  this 
worst-case  geometry  (plotted  in  Figure  6),  we  have  that: 

Ac 0  <  ( 2nd/  A )  sin  A 9  ~  ^max . 

JB^R 

Assuming  that  the  estimate  Co  from  the  previous  sounding 
phase  is  accurate,  if  we  ensure  that  2ndvrn^/ fsR^  < 
n/Nt,iD,  we  have  that  the  beamforming  losses  in  the  inter¬ 
vening  period  are  smaller  than  3dB.  This  tells  us  that  channel 
needs  to  be  sounded  often  enough  so  that 

fs  >  2dumax7Vt5ii)/PA  .  (18) 

In  the  following  discussions,  we  use  the  preceding  in  con¬ 
junction  with  (17)  to  determine  the  overhead  incurred  in 
estimating  the  channel  using  the  compressive  architecture 
proposed  herein. 
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Fig.  6:  Geometry  corresponding  to  maximum  change  in  ujz : 
The  user  moves  along  the  2  axis  at  a  speed  of  umax  in  the 
time  interval  1  //#  between  two  consecutive  channel  sounding 
rounds 


VII.  System  design 

We  now  discuss  some  key  aspects  of  downlink  system 
design  related  to  our  compressive  architecture.  We  start  by 
choosing  basestation  transmit  power  based  on  rules  set  by 
regulatory  authorities,  and  then  filling  in  the  other  details 
of  the  protocol  according  to  the  prescriptions  laid  out  in 
Section  VI.  Fixing  the  mobile  array  to  be  4  x  4  (lVr,i£>  =  4), 
we  consider  two  different  choices  for  the  base  station  array 
size:  8x8  (Nt,iD  =  8)  and  32  x  32  (Nt^D  =  32).  All 
arrays  are  d  =  A/2-spaced.  The  total  available  bandwidth  for 
communication  and  sounding  is  2GHz. 

A.  Transmit  power 

We  fix  the  effective  isotropically  radiated  power  (EIRP) 
to  40dBm,  consistent  with  Federal  Communications  Com¬ 
mission  (FCC)  regulations  for  60  GHz  unlicensed  transmis¬ 
sion.  Accounting  for  transmit  beamforming  gain  using  an 
Nt,iD  x  Nt,iD  array,  the  total  transmit  power 

P  =  40  -  20  log  Nt,  1D  dBm  (19) 

which  evaluates  to  22dBm  and  lOdBm  for  N^id  =  8  and 
Nt,iD  =  32  respectively.  Assuming  that  this  power  is  evenly 
split  among  the  N^1D  transmit  elements,  the  power  per  trans¬ 
mit  element  is  given  by  Pe  =  P  —  20  log  N^id  dBm  =  40  — 
201og7Vt?i£>  dBm,  which  evaluates  to  4dBm  and  —  20dBm, 
respectively.  Assuming  that  we  design  each  element  to  operate 
at  a  fixed  power,  this  is  also  the  power  per  element  used  in 
the  beaconing  phase,  even  though  the  latter  does  not  get  the 
benefit  of  transmit  beamforming. 


B.  Communication  range 

In  order  to  ensure  that  the  SNR  for  compressive  estimation 
is  adequate  over  a  picocell,  we  first  determine  the  picocell  size 
using  a  nominal  communication  link  budget,  and  then  calculate 
the  overhead  required  for  successful  estimation  at  that  range. 
Standard  link  budget  calculations,  assuming  oxygen  absorption 
of  16  dB/km,  an  EIRP  of  40dBm  and  a  4  x  4  receive  array 
providing  directivity  gains  of  12  dBi,  can  be  used  to  show 
that  we  can  attain  a  per-symbol  SNR  of  6dB  at  a  link  margin 
of  lOdB  for  a  symbol  rate  of  2 GHz  at  a  range  of  200m.  For 
omnidirectional  free  space  propagation,  the  power  gain  in  dB 
as  a  function  of  range  r  is  given  by 

GdB{r )  =  -\xr  +  20 log10  A-  (dB) 


where  p  =  0.016dB/m  to  account  for  oxygen  absorption. 
Note  that  //  can  be  increased  in  order  to  account  for  rain. 
However,  since  our  purpose  is  to  ensure  that  channel  estima¬ 
tion  is  successful  whenever  communication  is  successful,  the 
contribution  due  to  GdB (j)  cancels  out,  as  we  show  shortly. 
Thus,  while  the  particular  value  of  p  determines  picocell  size, 
we  shall  see  that  it  does  not  affect  the  overhead  for  channel 
estimation. 

The  SNR  per  symbol  is  given  by 

SNRc(dB)  =  EIRP  (dBm)  +  Gd#(r)  +  201og10  lVr,iD 

’  (20) 

10  log10  (No Wc)  Lrnargin^COmm) 

where  Lmargin(comm )  is  the  link  margin  (dB)  for  communi¬ 
cation.  Note  that  101og1Q  Ao  =  —174  +  NF  dBm  over  a 
bandwidth  of  1  Hz,  where  NF  denotes  the  receiver  noise 
figure  in  dB.  Plugging  in  Wc  =  2GHz,  7Vr,iD  =  4,  and 
NF  =  6dB,  we  obtain  a  per  symbol  SNR  of  7  dB  at  a  range 
of  r  =  100  meters. 


C.  Channel  sounding  protocol 

Our  channel  sounding  protocol  is  specified  by  four  param¬ 
eters:  (i)  bandwidth  used  by  each  basestation  when  sounding 
the  channel,  which  we  denote  by  Ws  (ii)  number  of  transmit 
beacons  (or  virtual  transmit  antennas)  M  (iii)  number  of 
receive  measurements  per  transmit  beacon  (or  virtual  receive 
antennas)  L  and  (iv)  sounding  rate  /#  which  determines 
how  often  the  channel  is  sounded.  The  parameters  M,  L 
and  Ws  together  determine  the  effective  sounding  SNR.  This 
must  exceed  the  ZZB  threshold  SNR  for  successful  channel 
estimation.  This  gives  rise  to  the  condition  in  17.  Imposing 
an  estimation  link  margin  Lrnargin(est)  (dB)  and  going  to  the 
dB  domain,  we  have 


10  log10  ( ML/WS )  >  SNRth  +  Lmargin(est)  (21) 
+  10  log10  -/Vo  —  P  —  GdB(r ) 


Adding  (21)  and  (20)  and  simplifying,  we  obtain 


101og10  {ML/WS)  >  SNRth  -  SNRc  +  Lmargin(est) 

L margin  (comm)  +  20  log10  Nt, id 
+  20  log10  NrAD  -  10  log10  Wc 


The  key  take-away  is  that  ML/WS  must  be  large  enough 
to  compensate  for  the  fact  that  we  do  not  have  the  benefit 
of  beamforming  during  the  sounding  phase.  Notice  that  the 
range  r  (i.e.,  the  dependence  on  picocell  size)  has  cancelled 
out.  Setting  Lmargin(est)  =  16dB  (we  use  a  higher  link 
margin  for  channel  sounding  to  account  for  power  losses  due 
to  randomness  of  A  and  £?),  we  obtain 


ML 

Time  taken  =  — —  > 

BA 


16.34  ps 
0.2669  ms 


Nt,iD  =  8 

JVt,  id  =  32. 


(22) 


We  choose  the  number  of  transmitter  beacons  for  the  8x8  and 
32  x  32  transmitter  arrays  based  on  the  geometry  preservation 
criterion  for  the  transmitter’s  spatial  channel  estimation  prob¬ 
lem  discussed  in  Section  VI-A.  We  use  M  —  24  for  N^id  =  8 
and  M  =  30  for  id  =  32  by  numerically  evaluating  the 
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Fig.  7:  Channel  sounding  and  communication  phases  of  the 
proposed  system 


worst-case  distortion  of  pairwise  distances  relevant  for  the 
channel  estimation  problem  (in  Figure  3  we  plot  the  worst- 
case  distortion  as  a  function  of  L  for  a  random  instance  of 
A  and  Nt^jj  =  32).  Using  the  receive  energy  preservation 
criterion  given  in  Section  VI-B,  we  choose  the  number  of 
receive  weights  for  the  4x4  receive  array  as  L  =  6.  Using 
these  values  for  M  and  L  in  (22),  we  obtain  that  the  channel 
sounding  bandwidth  must  satisfy 


WH  < 


8.8124  MHz 
674.34  KHz 


Nt,iD  =  8 

NtilD  =  32. 


Our  specification  of  the  channel  sounding  protocol  will  be 
complete  when  we  give  /#,  the  rate  at  which  we  sound  the 
channel  (see  Figure  7)  which  must  satisfy  (18).  Assuming  that 
the  closest  user  is  at  a  distance  of  R  m  20m  and  that  the 
maximum  speed  of  a  user  in  the  picocell  umax  is  45  miles 
per  hour  (20  m/s),  we  have  that:  /#  >  8  Hz  for  N^id  =  8 
and  fs  >  32  Hz  for  Ntj id  =  32.  Choosing  the  minimum 
value  for  /#,  we  have  that  the  overhead  for  our  channel 
sounding  protocol  is  MLfs/Ws  =  0.0131%  for  Nt:m  =  8 
and  0.8542%  for  NtilD  =  32. 


D.  Reuse  analysis  for  channel  sounding 

We  investigate  how  a  sequence  of  basestations  employed 
in  an  urban  canyon  environment  can  share  resources  when 
estimating  the  spatial  channel  to  users  in  their  respective 
cells.  The  envisioned  mm- wave  system  involves  alternating 
between  channel  estimation  and  communication  phases  as 
shown  in  Figure  7.  We  assume  that  channel  sounding  rounds 
across  basestations  are  aligned  in  time.  We  now  characterize 
how  the  2 GHz  spectrum  is  to  be  shared  in  space  so  as  to 
limit  the  effect  of  interference  from  neighboring  picocells  on 
channel  estimation  performance.  Such  interference  manage¬ 
ment  is  essential  in  the  sounding  phase;  unlike  the  highly 
directive  beams  used  in  the  communication  phase,  compres¬ 
sive  sounding  beacons  are  essentially  omnidirectional.  To  see 
this,  consider  the  average  transmit  power  along  any  direction 
u>.  This  is  given  by  Pe  \\Axt(u;)\\2 / L  «  Pe  ||xt(w)||2  = 
PeN^1D  =  P,  the  total  transmit  power.  The  approximation 
|| Ayit(<jj)\\2 1 L  «  ||xt(u?)||2  holds  when  the  number  of 
beacons  L  is  large  enough.  Therefore,  the  average  energy 
per-measurement  received  by  an  antenna  at  a  distance  r 
from  a  transmitter  sending  compressive  beacons  is  given  by 
PG(r),  where  G(r )  =  lOGdS(r)/10  =  X2  /  (16tt 2r2)  e~ur 
{y  =  (/i/10)  In  10)  is  the  omnidirectional  power  gain  at 
range  r.  We  assume  that  basestations  are  deployed  regularly 
as  shown  in  Figure  8  and  that  the  inter-basestation  separation 
(along  the  street)  is  given  by  S.  Suppose  that  the  reuse  factor 


is  R  (i.e,  every  Rt h  basestation  uses  the  same  slice  of  the 
frequency  spectrum  to  estimate  downlink  spatial  channels). 
We  assume  that  for  narrow  urban  canyons,  the  distance  be¬ 
tween  a  user  and  all  interfering  basestations  (those  that  are 
allocated  the  same  sounding  BW)  are  well  approximated  by 
{kRfS,  k  G  Z  \  {0}}.  Thus,  the  interference  power  seen  by 
a  single  antenna  is  given  by 

oo  oo 

/  =  2  x  PG(kRfs)  =  8P  G(kRfS), 

k=l  paths  k=  1 

where  we  have  assumed  that  there  are  4  viable  paths  between 
the  interfering  basestation  and  user,  each  introducing  the  same 
amount  of  interference  as  the  LoS  path.  This  is  a  pessimistic 
assumption,  since  NLOS  paths  are  attenuated  by  larger  path 
lengths  and  reflection  losses.  Plugging  in  the  expression  for 
G(r),  we  have  that 

oo 

/  =  (PX2 /2tt2 Rjs2)  e~uRfSk /k2 

k=  1 

=  (PX2/2Tr2R2fS2)U2  (e~vRts) , 

where  L12  (z)  =  YLk=i  zk  / k2  is  the  dilogarithm  function.  The 
interference  seen  per  antenna  adds  to  thermal  noise  to  give  an 
effective  per-element  noise  level  of  a2  =  NolUs+7.  Assuming 
a  worst-case  geometry  for  the  user  of  interest  (distance  of 
S  from  the  basestation)  and  proceeding  as  in  Section  VI-C, 
we  see  that  effective  Signal  to  Interference  and  Noise  Ratio 
SINReff  is  given  by 

SINReff  =  MLPG{S)/t 72e  . 

This  can  be  rewritten  as 

1/ SINReff  =  l/SNReff  +  l/SIReff  , 

where  SNReff  =  MLPG(S) /N0WS  and  the  Signal 
to  Interference  Ratio  SIReff  =  MLPG(S) / 1  = 
MLR2e~^s  j 8Li2  ( e~^Rfs ) .  We  need  to  ensure  that 
SINReff  exceeds  the  ZZB  SNR  threshold  for  successful  es¬ 
timation.  We  choose  the  reuse  factor  Rf  so  that  we  are  in  the 
noise-limited  regime  by  setting 

l/SIReff  <  0.1  x  (1/SNRth)  «  -10  -  16  dB. 

Assuming  that  protocol  parameters  are  chosen  so  that  SNReff 
exceeds  SNRth,  we  can  ignore  interference  in  SINReff  calcu¬ 
lations  when 

SIReff  >  SNRth  +  10  «  26dB  for  Nt:1D  =  8,  32. 

In  Figure  9,  we  plot  achievable  effective  SIRs  as  a  function 
of  frequency  reuse  factor  Rf  for  two  example  systems  in 
Section  VII-C:  i.e,  8x8  and  32  x  32  arrays  with  total 
number  of  measurements  given  by  ML  =  24  x  6  and 
ML  =  30  x  6  respectively.  As  the  picocell  size  S  grows, 
exponential  attenuation  due  to  oxygen  absorption  (the  e~vS 
term  in  the  expression  for  SIReff)  helps  in  attenuating  in¬ 
terference  and  improving  SIR  for  same  reuse  factor  Rf.  To 
illustrate  this  we  plot  SIR  as  a  function  of  Rf  for  three  cell 
sizes  S  =  50, 100,  200m  in  Figure  9.  We  observe  that,  in  order 
to  ensure  SIReff  >  26dB,  a  reuse  factor  of  Rf  =  4  is  needed 
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Fig.  8:  Reuse  of  frequency  resources  for  reuse  factor  Rf  =  3 


Fig.  9:  Effective  Signal  to  Interference  Ratio  SIReff  for  ML  = 
24  x  6  (8  x  8  scenario)  and  ML  =  30  x  6  (32  x  32  scenario; 
dashed)  as  a  function  of  reuse  factor  Rf 

for  S  =  50m,  while  Rf  =  3  suffices  for  S  =  200m.  Plugging 
in  the  per-basestation  sounding  bandwidth  Ws  calculations  in 
Section  VII-C,  we  see  that  the  overall  system-level  channel 
sounding  bandwidth  Ws  x  Rf  for  a  S  =  50m  picocell  is  as 
small  as  8.8124MHz  x  4  =  35.2MHz  and  674.34KHz  x  4  = 
2.7MHz  for  A^,id  =  8,  32,  respectively,  which  is  dwarfed  by 
the  total  available  bandwidth  (2GHz). 

VIII.  Simulation  results 

We  perform  simulations  for  the  two  example  systems  con¬ 
sidered  in  Section  VII  (8x8  and  32  x  32  transmit  arrays). 
We  report  results  for  the  algorithm  proposed  in  Section  V  and 
two  feedback  strategies:  (i)  ‘full’:  users  feedback  the  measured 
virtual  channel  matrix  Y  (M  x  L  matrix;  L  =  6  for  both 
systems)  and  (ii)  ‘svd’:  users  feedback  the  2  dominant  left 
singular  vectors  of  Y,  scaled  by  their  corresponding  singular 
values  (M  x  2  matrix;  one-third  feedback  overhead). 

We  consider  6  mobile  users  moving  in  the  urban  canyon 
at  speeds  of  20,  3,  15,  1.5,  2.1  and  10  meters  per  second 
(covering  both  vehicular  and  pedestrian  settings).  The  height 
at  which  each  mobile  device  is  held  is  in  the  1.3  —  1.4m  range. 
The  basestation  is  mounted  on  a  lamppost  on  the  pavement 
(7  meters  from  a  canyon  wall),  at  a  height  of  6  meters.  The 
basestation  antenna  array  is  tilted  by  about  7.5°  in  both  the 
azimuth  and  elevation  directions  so  that  the  boresight  of  the 
array  points  towards  middle  of  the  corresponding  cell.  This 
helps  in  more  accurate  spatial  frequency  estimation:  since  a 
change  in  direction  near  the  boresight  of  the  array  results 
in  larger  changes  in  spatial  frequencies  than  far  away  from 


Fig.  10:  Six  users  in  the  urban  canyon  moving  over  the 
duration  of  the  7  second  simulation  interval.  Their  positions 
at  time  t  =  0  is  marked  using  a  D-symbol 

the  boresight,  resolving  paths  is  easier  when  the  array  points 
towards  a  direction  in  which  we  are  likely  to  see  more  paths. 
We  do  not  model  blockage  in  these  simulations,  assuming 
that  the  LoS  path  and  the  three  first  order  reflections  are  all 
available.  Our  goal  is  to  estimate  and  track  the  K  =  4  paths 
to  all  6  users. 

Estimation  error:  Let  T  =  {a ;m  :  m  =  1, . . . ,  K}  denote 
the  true  spatial  frequencies  and  P  =  {u>n  :  n  =  1, . . . ,  K} 
denote  the  set  of  estimated  spatial  frequencies.  When  the 
base  station  uses  one  of  the  estimates  in  P,  say  u>,  to 
form  a  beam,  we  do  not  realize  the  full  201og7Vtji£)  dB 
beamforming  gain.  A  measure  of  the  sub- optimality  is  the 
estimation  error  ||u?  —  o>||2,  which  we  normalize  by  the  DFT 
spacing  of  27r/NtjiD  t°  define  the  following  error  metric: 

Aw(m)  =  mm\\u)m  -  u>n\\2  /  (2n/Nt  1D)  ■  (23) 

When  no  true  spatial  frequency  exists  near  an  estimate  i.e, 
when  u>  is  a  “phantom  estimate”,  we  will  quickly  be  able  to 
discard  it  when  we  beamform  in  the  direction  of  and  find 
that  the  mobile  does  not  receive  power  commensurate  to  what 
it  expects  with  the  20  log  Nt^D  beamforming  gain. 

We  plot  the  Complementary  Cumulative  Distribution  Func¬ 
tion  (CCDF)  of  estimation  errors  (23)  for  the  two  systems 
(NtiiD  =  8,  32)  in  Figure  11a  and  the  Probability  Distribution 
Function  (PDF)  of  the  number  of  paths  estimated  K  (correct 
value  is  K  =  4)  in  Figure  lib.  We  have  set  Pfa  =  10“3  tO 
arrive  at  r  used  in  the  stopping  criterion  (13).  From  Figure  11, 
we  see  that  feedback  of  dominant  singular  vectors  is  an 
efficient  feedback  strategy  which  performs  just  as  well  as 
feeding  back  the  entire  matrix  Y,  while  using  only  a  third 
of  uplink  resources. 

Next,  in  order  to  evaluate  the  effect  of  errors  in  spatial  fre¬ 
quency  estimation  on  beamforming  performance,  we  simulate 
a  simple  scenario  in  which  the  transmitter  beamforms  toward 
the  strongest  estimated  path.  Figure  12  shows  the  CDF  of  the 
achievable  beamforming  gain  for  an  8  x  8  array.  While  ideal 
beamforming  requires  adjustment  of  both  gains  and  phases, 
suboptimal  approaches  for  RF  beamsteering  with  severely 
quantized  phase-only  control  (four  phases)  have  been  studied 
in  our  earlier  conference  paper  [25].  We  see  from  Figure  12 
that  if  ideal  beamforming  were  performed  with  our  estimates, 
then  the  SNR  loss  is  less  than  0.3  dB.  If  four-phase  control 
is  used  based  on  our  estimates,  then  the  SNR  loss  is  less  than 
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(a)  CCDF  of  frequency  estimation  errors  with  new  stopping  criterion 
for  Nt,iD  =  8  (left)  and  Nt,iD  =  32  (right) 

Fig.  1 1 :  Two  feedback  strategies  considered:  (i)  M  x  6  matrix 
overhead) 


Fig.  12:  CDF  of  beamforming  gain  achieved  by  an  8  x  8 
antenna  array  for  ideal  and  quantized  beamforming  techniques 

1  dB.  The  results  for  32  x  32  arrays  are  entirely  similar,  and 
are  therefore  not  plotted  here. 

Thus  far,  we  have  not  said  anything  about  channel  frequency 
selectivity.  Our  proposed  algorithm  uses  a  small  segment  of 
the  band  to  estimate  the  spatial  channel,  and  the  problem  of 
channel  dispersion  is  not  addressed.  However,  we  note  that 
beamforming  using  a  large  array  should  reduce  the  effect 
of  undesired  paths,  which  simplifies  the  task  of  equalization. 
Figure  13  shows  the  channel  impulse  responses  for  the  32  x  32 
and  8x8  antenna  arrays  for  a  typical  snapshot,  when  the 
transmitter  beamforms  towards  the  strongest  estimated  path. 
In  our  simulated  setting,  the  LoS  and  ground  reflection  are 
close  to  each  other  in  terms  of  both  delays  and  angles  of 
departure.  We  see  that  8x8  array  fails  to  resolve  them,  with 
both  paths  falling  into  the  antenna’s  main  lobe,  while  the 
32  x  32  antenna  array,  which  has  smaller  beamwidth  (4°  half 
power  beamwidth),  attenuates  the  undesired  tap  down  to  one- 
ninth  of  the  desired  path.  Of  course,  it  is  possible  to  utilize 
the  channel  estimates  far  more  intelligently,  potentially  with 
nulls  directed  both  at  strong  undesired  paths  for  the  mobile  of 
interest,  and  at  the  dominant  paths  for  other  nearby  mobiles. 
The  latter  can  be  particularly  useful  for  combating  intra-cell 
interference  when  a  base  station  face  has  multiple  antenna 
arrays,  each  communicating  with  a  different  mobile. 

IX.  Conclusions 

We  have  shown  that  it  is  possible  to  super-resolve  mm  wave 
spatial  channels  with  a  relatively  small  number  of  compressive 
measurements,  in  a  manner  that  is  compatible  with  coarse 


(b)  PDF  of  #  of  estimated  paths  K  (K  —  4)  with  new  stopping 
criterion  for  Nt,iD  =  8  (left)  and  Nt,iD  =»'32  (right) 

Y  (‘full’)  and  (ii)  top  two  dominant  singular  vectors  (one-third 


Fig.  13:  Channel  impulse  response  with  quantized  beamform¬ 
ing  towards  estimated  strongest  path  for  the  8  x  8  (left)  and 
32  x  32  (right)  scenarios 

phase-only  control  and  RF  beamforming.  This  allows  scaling 
to  a  very  large  number  of  antenna  elements  without  relying 
on  channel  reciprocity.  While  our  discussion  of  system  design 
issues  such  as  link  budget  and  inter-cell  beacon  interference 
is  tailored  to  outdoor  60  GHz  picocellular  networks,  the  basic 
approach  is  broadly  applicable  (e.g.,  to  other  bands,  and  to 
indoor  environments).  An  important  topic  for  future  work 
is  comprehensive  experimental  validation  of  our  compressive 
approach,  which  is  based  on  a  simple  channel  model  including 
only  the  dominant  rays.  At  the  network  level,  there  are  a  host 
of  design  issues  (e.g.,  see  discussion  in  [3]).  The  compressive 
approach  allows  each  base  station  to  build  up  an  inventory 
of  viable  paths  to  nearby  mobiles,  but  there  is  a  huge  design 
space  to  be  explored  on  how  base  stations  coordinate  using 
this  information  to  alleviate  the  effects  of  blockage  (mobiles 
in  urban  environments  can  be  routinely  blocked  by  pedestrians, 
automobiles,  trees  and  other  obstacles),  and  to  manage  inter- 
and  intra-cell  interference.  Optimization  of  arrays  of  subarrays 
in  base  station  “faces”  for  communicating  with  multiple  users, 
as  well  as  for  handling  channel  dispersion,  presents  interesting 
design  challenges. 
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Appendix 


A.  Stopping  criterion 

In  this  section  we  derive  the  CFAR-based  stopping  criterion 
(13).  In  essence,  we  wish  to  determine  when  the  residual 
measurements,  after  subtracting  out  the  contribution  due  to 
the  paths  estimated  thus  far,  can  be  explained  well  enough  by 
noise,  up  to  a  nominal  false  alarm  probability  of  P/a ■ 

Under  the  noise-only  hypothesis  (i.e.,  assuming  all  existing 
paths  have  been  detected  and  subtracted),  we  have 

\l(Vq)K,Zi  zt  ~  CM  (0,  cr2IM)  l  =  l,...,L 


Recall  that  we  use  (9),  maximized  over  an  oversampled  DFT 
grid,  to  detect  a  new  frequency.  For  our  stopping  criterion  (13), 
we  consider  the  maximum  over  a  DFT  grid  (this  simplifies 
analysis  and  design),  and  stop  if  this  is  lower  than  a  threshold, 
as  follows: 


^7-iRS)iF|jKx<")'Vira|2sT  <24) 


The  false  alarm  rate  is  the  probability  that  the  threshold  is 
exceeded  under  the  noise-only  hypothesis: 


Pfa  =  Pr 


max 

U/EDFT 


>  r 


(25) 


The  noise  projections  are  i.i.d.  CJ\f  (0, 1)  across  uo  (due  to  the 
orthogonality  of  the  DFT  basis)  and  i.i.d.  across  measurement 
time  indices  /.  Thus, 


:= 


cr||xH|| 


~CV(  0,1) 


and  the  sum  of  the  squared  magnitudes  across  projections  is 
a  Gamma  random  variable: 


L 

P^UJ  •=  ^2  I2  ~  Gamma  (L,  1) 

i=i 

We  therefore  obtain 

f  T  j  f  T  \  NiD,t 

Pfa  =  P  (.“FT  ^  ^  j  =  1  -  P  i^o  <  ^  j 

=  l-(7  (L,  T/a2)/Y[L))N^ 

where  7  (L,x)  =  f*  tL~1e~t  dt  is  the  incomplete  gamma 
function,  and  T(L)  =  7(L,  00)  is  the  gamma  function.  For 
a  given  Pfa ,  the  threshold  r  is  now  given  by 

r  =  aV1  (i,  r(L)(l  -  Pfa)1/N?^)  (26) 

where  7_1(L,  y)  denotes  the  inverse  of  the  incomplete  gamma 
function  with  respect  to  the  integral  limit  x. 


B.  Complexity  analysis 

We  go  over  the  computations  that  are  needed  to  maintain 
the  estimate  of  one  user’s  channel  over  time  and  point  out 
portions  of  the  computation  phase  that  can  be  reused  across 
users. 


The  first  step  involves  computing  responses  to  each  of  the 
R2N  spatial  frequency  grids  (N  =  N2D  t  as  the  number  of 
transmitter  antennas): 

X  —  {x  (cv)  =  Axt(u>)  :  u>  G  T>}  . 

This  can  be  efficiently  computed  using  the  Fast  Fourier  Trans¬ 
form  in  0(MNR2  log(NR))  time  and  can  be  shared  across 
users.  Furthermore,  the  set  of  responses  A  can  be  stored  at 
the  basestation  and  reused  across  sounding  rounds.  Thus,  the 
cost  of  computing  A  is  amortized  over  users  in  the  network 
and  across  sounding  rounds. 

The  next  step  involves  finding  the  strongest  path  us¬ 
ing  (9)  and  the  corresponding  gains.  This  operation  takes 
0(MLNR 2)  time.  We  then  proceed  to  the  refinement  stage 
which  takes  0(M(L  +  N))  time  ( 0(MN )  to  compute 
dx(u:)/duji  and  O(ML)  for  gradient  and  Hessian  compu¬ 
tation). 

When  multiple  paths  are  present,  each  path  is  refined 
again  for  every  subsequently  detected  path.  Therefore,  the 
overall  complexity  when  a  new  user  joins  the  network  is 
0(K2)  x refinement  complexity + K  x  detection  complexity  = 
0(MK2(L  +  N)  +  KMLNR 2).  In  arriving  at  this  scaling, 
we  use  the  fact  that  our  algorithm  is  residue  centric  and 
therefore  we  keep  track  of  {v^  (Vq)}.  As  a  result  we  can 
compute  {vi(Vq  \  (u>k,  in  0(LM )  time  as  opposed 

to  the  O(qLM)  time  that  would  have  been  needed  if  this 
computation  were  to  be  done  directly  from  {y/}. 

When  we  track  the  channel  using  estimates  from  prior 
rounds,  we  compute  the  gains  {hk,i}  of  the  existing  spatial 
frequency  estimates  from  prior  round  given  by  {u^}  as 
(XhX)~1XhY  and  this  takes  0{KM(L  +  N))  time.  We 
then  refine  these  estimates  using  our  refinement  algorithm  in 
0(KM(L  +  N))  time. 

Paths  that  were  viable  in  previous  sounding  rounds  may 
be  blocked  in  the  current  sounding  round.  To  prune  out 
such  paths,  we  test  whether  the  residual  measurements  after 
the  deletion  of  the  under  question  path  satisfy  our  stopping 
criterion.  This  check  takes  0(KM(L-\-  N)  +MLN)  time  for 
each  path  and  path  pruning  complexity  scales  as  0(K2M(L-\- 
N)  +KMLN).  Checking  whether  a  new  path  exists  (stopping 
criterion)  also  costs  O(MLN)  time.  Therefore,  the  overall 
tracking  complexity  scales  as  0(KM(KL  +  KN  +  LN ))  = 
0(KMN(L  +  K))  (since  N  >  K)  time. 
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